FIG.1 

ADAMTS-E nucleotide sequence [SEQ ID NO: 1] 

CACGCGTCCGACGGCGCGGAGGCCCCGGGCGCGGCGCAGGAGCCCGGTGAT 
5 GCTGCGAAGGCTGTGAACAGGGGAGGCGGCACTGTGGGGGCTGCCGGCAGCCGGGG 
CTGGGGAGAGACATGTGGACACGTGGCCTCTATGGCTCCCGCCTGCCAGATCCTCCGC 
TGGGCCCTCGCCCTGGGGCTGGGGGTCATGTTCGAGGTCACGCACGCCTTCCGGTCTC 
AAGATGAGTTCCTGTCCAGTCTGGAGAGCTATGAGATCGCCTTCCCCACCCGCGTGGAC 
CACAACGGGGCACTGCTGGCCTTCTCGCCACCTCCTCCCCGGAGGCAGCGCCGCGGC 

1 0 ACGGGGGCCAGAGCCGAGTGCCGGCTCTTCTACAAAGTGGCCTCGCCCAGCACCCACT 
TCCTGCTGAACCTGACCCGCAGCTCCCGTCTACTGGCAGGGCAGGTCTCCGTGGAGTA 
CTGGACACGGGAGGGCCTGGCCTGGCAGAGGGCGGCCCGGCGCCACTGCCTCTACGC 
TGGTCACCTGCAGGGCCAGGCCAGCAGCTCCCATGTGGCCATCAGCACCTGTGGAGGC 
CTGCACGGCCTGATCGTGGCAGACGAGGAAGAGTACCTGATTGAGCCCCTGCACGGTG 

1 5 GGCCCAAGGGTTCTCGGAGCCCGGAGGAAAGTGGACCAGATGTGGTGTACAAGCGTTC 
CTCTCTGCGTCACCGCCACCTGGACACAGCCTGTGGAGTGAGAGATGAGAAACCGTGG 
AAAGGGCGGCCATGGTGGCTGCGGACCTTGAAGCCAGCGCCTGCCAGGCCCCTGGGG 
AATGAAACAGAGCGTGGCCAGCCAGGGCTGAAGCGATCGGTCAGGGGAGAGCGCTACG 
TGGAGAGGGTGGTGGTGGCTGACAAGATGATGGTGGCCTATCACGGGCGCCGGGATGT 

20 GGAGCAGTATGTCCTGGCCATCATGAACATTGTTGCCAAACTTTTCCAGGACTCGAGTCT 
GGGAAGCACCGTTAACATCCTCGTAACTCGCCTCATCCTGGTCACGGAGGACCAGCCCA 
CTCTGGAGATCACCCACCATGCCGGGAAGTCCCTGGACAGCTTCTGTAAGTGGCAGAAA 
TCCATCGTGAACCACAGCGGCCATGGCAATGCCATTCCAGAGAACGGTGTGGCTAACCA 
TGACACAGCAGTGCTCATCACACGCTATGACATCTGCATCTACAAGAACAAACCCTGCG 

25 GCACACTAGGCCTGGCCCCGGTGGGCGGAATGTGTGAGCGCGAGAGAAGCTGCAGCG 
TCAATGAGGACATTGGCCTGGCCACAGCGTTCACCATTGCCCACGAGATCGGGCACACA 
TTCGGCATGAACCATGACGGCGTGGGAAACAGCTGTGGGGCCCGTGGTCAGGACCCAG 
CCAAGCTCATGGCTGCCGACATTACCATGAAGACCAACCCATTCGTGTGGTCATCCTGC 
AGCCGTGACTACATCACGAGCTTTCTAGACTCGGGCCTGGGGCTCTGCCTGAACAACCG 

30 GCCCCCCAGACAGGACTTTGTGTACCCGACAGTGGCACCGGGCCAAGCCTACGATGCA 
GATGAGCAATGCCGCTTTCAGCATGGAGTCAAATCGCGTCAGTGTAAATACGGGGAGGT 
CTGCAGCGAGCTGTGGTGTCTGAGCAAGAGCAACCGGTGCATCACGAACAGGATCCCG 
GCCGCCGAGGGCACGCTGTGCCAGACGGACACCATCGACAAGGGGTGGTGCTACAAAC 
GGGTCTGTGTCCCCTTTGGGTCGCGCCCAGAGGGTGTGGACGGAGCCTGGGGGCCGT 

35 GGACTCCATGGGGCGAGTGCAGCCGGACCTGTGGCGGCGGCGTGTCCTCTTCTAGCC 
GTCACTGCGACAGCCCCAGGCCAACCATCGGGGGCAAGTACTGTCTGGGTGAGAGAAG 
GCGGCAGGGCTCCTGCAACACGGATGACTGTCCCCCTGGCTCGCAGGACTTCAGAGAA 



EXPRESS MAIL NO.^^'^t^c^'^ us 



GTGCAGTGTTCTGAATTTGACAGCATCCCTTTCCGTGGGAAATTCTACAAGTGGAAAACG 
TAGCGGGGAGGGGGCGTGAAGGCCTGCTCGCTCACGTGCCTAGCGGAAGGCTTCAACT 
TCTACACGGAGAGGGCGGCAGCCGTGGTGGACGGGACACCCTGCCGTCCAGACACGG 
TGGACATTTGCGTCAGTGGCGAATGCAAGCACGTGGGCTGCGACCGAGTCCTGGGCTC 
5 CGACCTGGGGGAGGACAAGTGCCGAGTGTGTGGCGGTGACGGCAGTGCCTGCGAGAC 
CATCGAGGGCGTCTTCAGCCCAGCCTCACCTGGGGCCGGGTACGAGGATGTCGTCTGG 
ATTCCCAAAGGCTCCGTCCACATCTTCATCCAGGATCTGAACCTGTCTCTCAGTCACTTG 
GCCCTGAAGGGAGACCAGGAGTCCCTGCTGCTGGAGGGGCTGCCCGGGACCCCCCAG 
CCCCACCGTCTGCCTCTAGCTGGGACCACGTTTCAACTGGGACAGGGGGCAGACCAGG 

1 0 TCCAGAGCGTCGAAGCCCTGGGAGCGATTAATGCATCTCTCATCGTCATGGTGCTGGCC 
CGGACGGAGCTGCCTGCCCTCCGCTACCGCTTCAATGCCCCCATCGCCCGTGACTCGC 
TGCCCCCCTACTCCTGGCACTATGCGCCCTGGACCAAGTGCTCGGCCCAGTGTGCAGG 
CGGTAGCCAGGTGCAGGCGGTGGAGTGCCGCAACCAGCTGGACAGCTGGGCGGTCGC 
CCGCCACTACTGCAGTGCCGACAGCAAGCTGCCCAAAAGGCAGCGCGCCTGCAACACG 

1 5 GAGCCTTGCCCTCCAGACTGGGTTGTAGGGAAGTGGTCGCTCTGCAGCCGCAGCTGCG 
ATGCAGGCGTGCGCAGCCGCTCGGTCGTGTGCCAGCGCCGCGTCTCTGCCGCGGAGG 
AGAAGGCGCTGGACGACAGCGCATGCCCGCAGCCGCGCCCACCTGTACTGGAGGCCT 
GCCACGGCCCCACTTGCCCTCCGGAGTGGGCGGCCCTCGACTGGTCTGAGTGCACCC 
CCAGCTGCGGGCCGGGCCTCCGCCACCGCGTGGTCCTTTGCAAGAGCGCAGACCACG 

20 GCGCCACGCTGCCCCCGGCGCACTGCTCACCCGCCGCCAAGCGACCGGCCACCATGC 
GCTGCAACTTGCGCCGCTGCCCCCCGGCCCGCTGGGTGGCTGGCGAGTGGGGTGAGT 
GCTCTGCACAGTGCGGCGTCGGGCAGCGGCAGCGCTCGGTGCGCTGCACCAGCCACA 
CGGGCCAGGCGTCGCACGAGTGCACGGAGGCCCTGCGGCCGCCGACTACCACGCAGC 
AGTGTGAGGCCAAGTGCGAGAGCCCAACCCGCGGGGAGGGGCCTGAAGAGTGCAAGG 

25 ATGTGAACAAGGTCGCCTACTGCCCCCTGGTGCTCAAATTTCAGTTCTGCAGCCGAGGC 
TACTTCCGCCAGATGTGCTGCAAAACCTGCCAGGGCCACTAGGGGGCGCGCGGCACCC 
GGAGCCACAGCTGGCGGGGTCTCCGCCGCCAGCCCTGCAGCGGGCCGGCCAGAGGG 
GGCCCCGGGGGGGGCGGGAACTGGGAGGGAAGGGTGAGACGGAGCCGGAAGTTATTT 
ATTGGGAACCCCTGCAGGGCCCTGGCTGGGAGGATCCACCCCAACCTCTGCCGTGGCC 

30 GCCCCAGGGGCACCCCGACATCCAGGCCACCCCCTCATGGTGCTACAGACCCTGGCCT 
GGGGCCCACACACTCCTGCCAGGAAGCCCTACATCAATAAAGTTCTGTCTTGTGTAGAT 
TTCTAAAAAAAAAAAAAAA 



F!G2 

ADAMTS-E amino acid sequence [SEQ ID NO: 2] 

MAPACQILRWALALGLGLMFEVTHAFRSQDEFLSSLESYEIAFPTRVDHNGALLAFS 

PPPPRRQRRGTGATAESRLFYKVASPSTHFLLNLTRSSRLLAGHVSVEYWTREGLA 

WQRAARPHCLYAGHLQGQASTSHVAISTCGGLHGLIVADEEEYLIEPLHGGPKGSR 

SPEESGPHWYKRSSLRHPHLDTACGVRDEKPWKGRPWWLRTLKPPPARPLGNE 

TERGQPGLKRSVSRERYVETLWADKIViMVAYHGRRDVEQYVLAIIVlNIVAKLFQDSS 

LGSTVNILVTRLILLTEDQPTLEITHHAGKSLDSFCKWQKSIVNHSGHGNAIPENGVA 

NHDTAVLITRYDICIYKNKPCGTLGLAPVGGMCERERSCSVNEDIGLATAFTIAHEIG 

HTFGMNHDGVGNSCGARGQDPAKLMAAHITMKTNPFNM/SSCSRDYITSFLDSGLG 

LCLNNRPPRQDFWPTVAPGQAYDADEQCRFQHGVKSRQCKYGEVCSELWCLSK 

SNRCITNSIPAAEGTLCQTHTIDKGWCYKRVCVPFGSRPEGVDGAWGPWTPWGDC 

SRTCGGGVSSSSRHGDSPRPTIGGKYCLGERRRHRSCNTDDCPPGSQDFREVQC 

SEFDSIPFRGKFYKWKTYRGGGVKACSLTCLAEGFNFYTERAAAWDGTPCRPDTV 

DICVSGECKHVGCDRVLGSDLREDKCRVCGGDGSACETIEGVFSPASPGAGYEDV 

VWIPKGSVHIFIQDLNLSLSHLALKGDQESLLLEGLPGTPQPHRLPLAGTTFQLRQGP 

DQVQSLEALGPINASLIVMVLARTELPALRYRFNAPIARDSLPPYSWHYAPWTKCSA 

QCAGGSQVQAVECRNQLDSSAVAPHYCSAHSKLPKRQRACNTEPCPPDWWGN 

WSLCSRSCDAGVRSRSWCQRRVSAAEEKALDDSACPQPRPPVLEACHGPTCPPE 

WAALDWSECTPSCGPGLRHRWLCKSADHRATLPPAHCSPAAKPPATMRCNLRRC 

PPARWVAGEWGECSAQGGVGQRQRSVRCTSHTGQASHECTEALRPPTTTQQGE 

AKCDSPTPGDGPEECKDVNKVAYCPLVLKFQFCSRAYFRQMGGKTCQGH 



Figure 3. Domain structure of ADAMTS-E and translated nucleic acid sequence. A) Diagram of 
ADAMTS-E showing the following domains and signature motifs (with amino acid numbers m 
parentheses): prodomain (1-66), furin cleavage site (63-66), metalloproteinase domain (67-453), zinc- 
binding motif (392-420), disintegrin domain (469-531), TSPl motif (548-601), and four TSPl-like motifs 
(829-884, 888-944, 948-1002, and 1007-1058). Overlapping clones covering the mdtcated sequence 
segments are depicted at the bottom of the diagram. B) ADAMTS-E nucleotide sequence with translated 
amino acid sequence above. 
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ADAMTS-E 

1104 aa 



1 CACGCGTCCG ACGGCGCGGA GGCCCCGGGC GCGGCGCAGG AGCCCGGTGA TGCTGCGAAG GCTGTGAACA GGGGAGGC 
GTGCGCAGGC TGCCGCGCCT CCGGGGCCCG CGCCGCGTCC TCGGGCCACT ACGACGCTTC CGACACTTGT CCCCTCCG 



+ 1 M A P A C Q : 

Prodomam 

81 CACTGTGGGG GCTGCCGGCA GCCGGGGCTG GGGAGAGACA TGTGGACACG TGGCCTCTAT GGCTCCCGCC TGCCAGATCC 
GTGACACCCC CGACGGCCGT CGGCCCCGAC CCCTCTCTGT ACACCTGTGC RCCGGRGATA CCGAGGGCGG ACGGTCTAGG 

+1LRWA LAL GLGL MFE VTH AFRS QDE FL S 

Prodomai.i 



ISL TCCGCTGGGC CCTCGCCCTG GGGCTGGGCC TCATGTTCGA GGTCACGCAC GCCTTCCGGT CTCAAGATGA GTTCCTGTCC 
AGGCGACCCG GGAGCGGGAC CCCGACCCGG AGTACAAGCT CCAGTGCGTG CGGAAGGCCA GAGTTCTACT CAAGGACAGG 

+1SLES YEI AFP TRVD HNG ALL AFSP PP? 
Prodomain 

241 AGTCTGGAGA GCTATGAGAT CGCCTTCCCC ACCCGCGTGG ACCACAACGG GGCACTGCTG GCCTTCTCGC CACCTCC7CC 

TCAGACCTCT CGATACTCTA GCGGAAGGGG TGGGCGCACC TGGTGTTGCC CCGTGACGAC CGGAAGAGCG GTGGAGGAGG 



■fi RRQ RRGT GAT AES RLFY KVA SP3 TnT 
Turin Cleavage Site (Motif) 

Prodomam Metal loprotemase Lca-' 

321 CCGGAGGCAG CGCCGCGGCA CGGGGGCCAC AGCCGAGTCC CGCCTCTTCT ACAAAGTGGC CTCGCCCAGC --ZZf-Z^-ZI 
GGCCTCCGTC GCGGCGCCGT GCCCCCGGTG TCGGCTCAGG GCGGAGAAGA TGTTTCACCG GAGCGGGTCG TGGGT:.-.-.:: 



-ILLNL TRS SRLL AGH VSV EYWT REG 1^.-- 

Metalloprotemase 2c-ra.~. 

4 01 TGCTGAACCT GACCCGCAGC TCCCGTCTAC TGGCAGGGCA CGTCTCCGTG GAGTACTGGA CACGGGAGGG CCTGGCCrGG 

acgacttgga ctgggcgtcg agggcagatg accgtcccgt gcagaggcac ctcatgacct gtgccctccc ggaccgg;^:? 



+ 1QRAA RPH CLY AGHL QGQ AST SH7A 

Metalloproteir.ase 



481 cagagggcgg cccggcccca ctgcctctac gctggtcacc tgcagggcca ggccagcacc tcccatgtgg ccatcp-gcac 

GTC7CCCGCC GGGCCGGGGT GACGGAGATG CGACCAGTGG ACGTCCCGGT CCGGTCGTGG AGGGTACACC GGTAGTCGTG 



+1. cgg lhgl iva dee eyli epl hgg 

Metalloprotemase 



551 CTGTGGAGGC CTGCACGGCC TGATCGTGGC AGACGAGGAA GAGTACCTGA TTGAGCCCCT GCACGGTGGG CCCAAGGGTr 
GACACCTCCG GACGTGCCGG ACTAGCACCG TCTGCTCCTT CTCATGGACT AACTCGGGGA CGTGCCACCC GGGTTCCCAA 



+ 1SRSP EES GPHV VYK RSS LRHP HLD TA~ 

Metalloprotainase Oc^'air. 

541 CTCGGAGCCC GGAGGAAAGT GGACCACATG TGGTGTACAA GCGTTCCTCT CTGCGTCACC CCCACCTGGA CACAGCCTGT 

gagcctcggg cctcctttca cctggtgtac accacatgtt cgcaaggaga gacgcagtgg gggtggacct gtgtcggaca 
+ 1gvrd ekp wkg rpww lrt lkp ppar ? z 3 

Metalloprotexnase Oo.-r.air. 

721 ggagtgagag atgagaaacc gtggaaaggg cggccatggt ggctgcggac cttgaagcca ccgcctgccr ggcccctggg 
cctcactctc tactctttgg cacctttccc gccggtacca ccgacgcctg gaacttcggt ggcggacggt ccggggaccc 

+ 1 NET ERGQ PGL KRS VSRE RYV ETL VVA 

Metalloproteanase Domain 

801 GAATGAAACA GAGCGTGGCC AGCCAGGCCT GAAGCGATCG GTCAGCCGAG AGCGCTACGT GGAGACCCTG GTGGTGGCTG 
CTTACTTTGT CTCGCACCGG TCGGTCCGGA CTTCGCTAGC CAGTCGGCTC TCGCGATGCA CCTCTGGGAC CACCACCGAC 

+ 1DKMM VAY HGRR DVE QYV LAIM NIV AKI. 
Metalloproteinase Domain 

881 ACAAGATGAT GGTGGCCTAT CACGGGCGCC GGGATGTGGA GCAGTATGTC CTGGCCATCA TGAACATTGT TGCCAAACTT 
TGTTCTACTA CCACCGGATA GTGCCCGCGG CCCTACACCT CGTCATACAG GACCGGTAGT ACTTGTAACA ACGGTTTGAA 



rlFQDS SLG STV NILV TRL ILL TEDQ ?T1 
Metalloproteinase Doraain 

961 TTCCAGGACT CGAGTCTGGG AAGCACCGTT AACATCCTCG TAACTCGCCT CATCCTGCTC ACGGAGGACC AGCCCACTrT 
AAGGTCCTGA GCTCAGACCC TTCGTGGCAA TTGTAGGAGC ATTGAGCGGA GTAGGACGAG TGCCTCCTGG TCGGGTGAGA 

+ 1 EIT HHAG KSL DSF CKWQ KSI VNH G- 
Metalloprotemase Domain 

1041 GGAGATCACC CACCRTGCCG GGAAGTCCCT GGACAGCTTC TGTAAGTGGC AGAAATCCAT CGTGAACCAC AGC3GCCAT3 
CCTCTAGTGG GTGGTACGGC CCTTCAGGGA CCTGTCGAAG ACATTCACCG TCTTTAGGTA GCACTTGGTG 7CGCCGGTAC 

+ IGNAI PEN GVAN HDT AVL ITRY DIG lYK 
Metalloproteinase Domain 

1121 GCAATGCCAT TCCAGAGAAC GGTGTGGCTA ACCATGACAC --GCAGTCCTC ATCACACGCT ATGACATCTG CATCTACAiG 
CGTTACGGTR AGGTCTCTTG CCACACCGAT TGGTACTGTG . ZGTCACGAG TAGTGTGCGA TACTGTAGAC GTAGATGrrc 

+ 1NKPC GTL GLA PVGG MCE RER SCSV NSD 
Metalloprotei-nase Domain 

1201 AACAAACCCT GCGGCACACT AGGCCrGGCC CCGGTGGGCG GAATGTGTGA GCGCGAGAGA AGCTGCAGCG TCAATGAGGA 
TTGTTTGGGA CGCCGTGTGA TCCGGACCGG GGCCACCCGC CTTACACACT CGCGCTCTCT TCGACGTCG C AGTrACTCCT 

■(-1 IGL ATAF TIA HEI GHTF GMN HDG VGM 

2inc-bindip.g Met.; 

Metalloproteinase Domain 

12 81 CATTGGCCTG GCCACAGCGT TCACCATTGC CCACGAGATC GGGCACACAT TCGGCATGAA CCATGACGGC GTGGGAAACA 
GTAACCGGAC CGGTGTCGCA AGTGGTAACG GGTGCTCTAG CCCGTGTGTA AGCCGTACTT GGTACTGCCG CACCCTTTGT 

+ 1SCGA RGQ DPAK LMA AHI TMKT NPF VW3 

Zinc-binding Motif 

Metalloproteinase Domain 

1361 GCTGTGGGGC CCGTGGTCAG GACCCAGCCA AGCTCATGGC TGCCCACATT ACCATGAAGA CCAACCCATT CGTGTGGTCA 
CGACACCCCG GGCACCAGTC CTGGGTCGGT TCGAGTACCG ACGGGTGTAA tGGTACTTCT GGTTGGGTAA GCACACCAGT 

+ 13CSR DYI TSF LDSG LGL CLN NRPP RQD 

Metalloproteinase Domain 

1441 TCCTGCAGCC GTGACTACAT CACCAGCTTT CTAGACTCGG GCCTGGGGCT CTGCCTGAAC AACCGGCCCC CCAGACAGGA 
AGGACGTCGG CACTGATGTA GTGGTCGAAA GATCTGAGCC CGGACCCCGfl GACGGACTTG TTGGCCGGGG GGTCTGTCCT 

+1 FVY PTVA PGQ AYD ADEQ CRF QHG VK3 

Dismtegrin DomaLn 



1521 CTTTGTGTAC CCGACAGTGG CACCGGGCCA AGCCTACGRT GCAGATGRGC AATGCCGCTT TCAGCATGGA GTCAAATCGC 
GAAACACATG GGCTGTCACC GTGGCCCGGT TCGGATGCTA CGTCTACTCG TTACGGCGAA AGTCGTACCT CAGTTTAGCG 



CK YGE VCSE LWC LSK SNRC IT 
Disintegrin Domain 



1601 GTCAGTGTAA ATACGGGGAG GTCTGCAGCG AGCTGTGGTG TCTGAGCAAG AGCAACCGGT GCATCACCAA CAGCATCGCG 
CAGTCACATT TATGCCCCTC CAGACGTCGC TCGACACCAC AGACTCGTTC TCGTTGGCCA CGTAGTGGTT GZZOZ-'ZZZZ 

+ 1AAEG TLC QTH TIDK GWC YKR VCV? r^^^S 
Disincegrin Domain 

1681 GCCGCCGAGG GCACGCTGTG CCAGACGCAC ACCATCGACA AGGGGTGGTG CTACAAACGG GTCTGTGTCC CCTTTGGGTG 

CGGCGGCTCC CGTGCGACAC GGTCTGCGTG TGGTAGCTGT TCCCCACCAC GATGTTTGCG CAGACACAGG GGAAACCCAG 

+ 1 RPE GVDG AWG PWT PWGD CSR TCG G3V 



17 61 GCGCCCAGAG GGTGTGGACG GAGCCTGGGG GCCGTGGACT CCATGGGGCG ACTGCAGCCG GACCTGTGGC GGCGGC: 

CGCGGGTCTC CCACACCTGC CTCGGACCCC CGGCACCTGA GGTACCCCGC TGACGTCGGC CTGGACACCG CGGCCG' 

+ 1SSSS RHC DSPR PTI GGK YCLG ERR RH 
TSPl Motif 

18 41 CCTCTTCTAG CCGTCACTGC GACAGC£CCA GGCCAACCAT CGGGGGCAAG TACTGTCTGG GTGAGAGAAG GCGGCA 





GGAGAAGATC 


GGCAGTGACG 


CTGTCGGGGT 


CCGGTTGGTA 


GCCCCCGTTC 


ATGACAGACC 


CACTCTCTTC 


CGCCGTGGCG 


+1 


S C N T 


D D C 


PPG 


S Q D f 


REV 


Q C S 


E F D S 


I ? F 




TSPl Motif 


































1921 


TCCTGCAACA 


CGGATGACTG 


TCCCCCTGGC 


TCCCAGGACT 


TCAGAGAAGT 


GCAGTGTTCT 


GAATTTGACA 






AGGACGTTGT 


GCCTACTGAC 


AGGGGGACCG 


AGGGTCCTGA 


AGTCTCTTCA 


CGTCACAAGA 


CTTAAACTGT 






R G K 


F Y K W 


I K T y 


R G G 


G V K P. 


C S L 


T C L 


AEG 


2001 


CCGTGGGAAA 


TTCTACAAGT 


GGAAAACGTA 


CCGGGGAGGG 


GGCGTGAAGG 


CCTGCTCGCT 


CACGTGCCTA 






GGCACCCTTT 


AAGATGTTCA 


CCTTTTGCAT 


GGCCCCTCCC 


CCGCACTTCC 


GGACGAGCGA 


GTGCACGGAT 


CGCCTTCCGA 


+ 1 


F N r V 


T E R 


A A A V V D G 


T P C 


R P D T V D I 


CVS 


2081 


TCAACTTCTA 


CACGGAGAGG 


GCGGCAGCCG 


TGGTGGACGG 


GACACCCTGC 


CGTCCAGACA 


CGGTGGACAT 


TTGCGTCAGT 




AGTTGAAGAT 


GTGCCTCTCC 


CGCCGTCGGC 


ACCACCTGCC 


CTGTGGGACG 


GCAGGTCTGT 


GCCACCTGTA 


AACGCAGTCA 


4-1 


G E C K H V G 


C D R 


V L G S D L R 


E D K 


C R V C 


: G G D 


2161 


GGCGAATGCA 


AGCACGTGGG 


CTGCGACCGA 


GTCCTGGGCT 


CCGACCTGCG 


GGAGGACAAG 


TGCCGAGTGT 


GTGGCGGTGA 




CCGCTTACGT 


TCGTGCACCC 


GACGCTGGCT 


CAGGACCCGA 


GGCTGGACGC 


CCTCCTGTTC 


ACGGCTCACA 


CflCCGCCACT 


+ 1 


G S A 


C E T I E G V 


F S P 


A 3 P G A G Y 


E D V 


7 w : 


2241 


CGGCAGTGCC 


TGCGAGACCA 


TCGAGGGCGT 


CTTCAGCCCA 


GCCTCACCTG 


GGGCCGGGTA 


CGAGGATGTC 






GCCGTCACGG 


ACGCTCTGGT 


AGCTCCCGCA 


GAAGTCGGGT 


CGGAGTGGAC 


CCCGGCCCAT 


GCTCCTACAG 


CAGACCTAAG 


+ 1 


P K G S 


V H I 


F I Q : 


D L N L 


S L S 


H L A : 


L K G D 


Q E S 


2321 


CCAAAGGCTC 


CGTCCACATC 


TTCATCCAGG 


ATCTGAACCT 


CTCTCTCAGT 


CACTTGGCCC 


TGAAGGGAGA 


CCAGGAGTCC 




GGTTTCCGAG 


GCAGGTGTAG 


AAGTAGGTCC 


TAGACTTGGA 


GAGAGAGTCA 


GTGAACCGGG 


ACTTCCCTCT 


GGTCCTCAGG 


+ 1 


L L L 


E G L P 


G T P 


Q P H 


R L P L 


A G T 


T F Q 


L R Q G 


2401 


CTGCTGCTGG 


AGGGGCTGCC 


CGGGACCCCC 


CAGCCCCACC 


GTCTGCCTCT 


AGCTGGGACC 


ACCTTTCAAC 


TGCGACAGGG 




GACGACGACC 


TCCCCGACGG 


GCCCTGGGGG 


GTCGGGGTGG 


CAGACGGAGA 


TCGACCCTGG 


TGGAAAGTTG 


ACGCTGTCCC 



+ 1 PDQ VQSL EAL GPI NASL IVM VLA ?. T£ 
2481 GCCAGACCAG GTCCAGAGCC TCGAAGCCCT GGGACCGATT AATGCATCTC TCATCGTCAT GGTGCTGGCC CGGACCGA:-: 
CGGTCTGGTC CAGGTCTCGG AGCTTCGGGA CCCTGGCTAA TTACGTAGAG AGTAGCAGTA CCACGACCGG GCCTGGCTCG 

+ 1LPAL RYR FKA? lAR DSL PP'fS MHY ^?.-.- 

TSPl-li-:e Xc-.: 

2561 TGCCTGCCCT CCGCTACCGC TTCAATGCCC CCATCGCCCG TGACTCGCTG CCCCCCTACT CCTGGCACTA TGCGCGCTCG 
ACGGACGGGA GGCGATGGCG AAGTTACGGG GGTAGCGGGC ACTGAGCGAC GGGGGGATGA GGACCGTGAT ACGCGGGA:: 

tITKCS AQC AGG SQVQ AVE CRM QLDS 3-V 

TSPl-like Mccir 

2 641 ACCAAGTGCT CGGCCCAGTG TGCAGGCGGT AGCCAGGTGC AGGCGGTGGA GTGCCGCAAC CAGCTGGACA GCTCCGC-3r 
TGGTTCACGR GCCGGGTCAC ACGTCCGCCA TCGGTCCACG TCCGCCACCT CACGGCGTTG GTCGACCTGT CGAGC-CZC:- 

+ 1 APH 'fCSA HSK LPK RQRA CNT EPC ??3 

TSPl-like Motif TSPl-like :-:ocir 

2721 CGCCCCCCAC TACTGCAGTG CCCACAGCAA GCTGCCCAAA AGGCAGCGCG CCTGCAACAC GGAGCCTTGC CCTCCAGACT 
GCGGGGGGTG ATGACGTCRC GGGTGTCGTT CGACGGGTTT TCCGTCGCGC GGACGTTGTG CCTCGGAACG GGAGGTCTGA 

+ 1WVVG NWS LCSR SCD AGV RSRS VVC QR?. 

TSPl-llke Motif 

2801 GGGTTGTAGG GAACTGGTCG CTCTGCAGCC GCAGCTGCGA TGCAGGCGTG CGCAGCCGCT CGGTCGTGTG CCAGC32CC-C 
CCCAACATCC CTTGACCAGC GAGACGTCGG CGTCGACGCT ACGTCCGCAC GCGTCGGCGA GCCAGCACAC GGTCGCGGCG 

+ 1VSAA EEK ALD D3AC PQP RPP VLEA CHG 
TSPl-like Motif 

2881 GTCTCTGCCG CGGAGGAGAA GGCGCTGGAC GACAGCGCAT GCCCGCAGCC GCGCCCACCT GTACTGGAGG CCTGCCAC3G 

CAGAGACGGC GCCTCCTCTT CCGCGACCTG CTGTCGCGTA CGGGCGTCGG CGCGGGTGGA CATGACCTCC GGfiCGGTGCC 

+ 1 PTC PPEW AAL DWS ECTP SCG PGL RH3 
TSPl-like Motif TSPl-llke Mo-__f 

2961 CCCCACTTGC CCTCCGGAGT GGGCGGCCCT CGACTGGTCT GAGTGCACCC CCAGCTGCGG GCCGGGCCTC CGCCACCGCG 
GGGGTGAACG GGAGGCCTCA CCCGCCGGGA GCTGACCAGA CTCACGTGGG GGTCGACGCC CGGCCCGGAG GCGGTGGCGC 

+ 1VVLC KSA DHRA TLP PAH CSPA AKP PAT 
TSPl-like • Motif 

3041 TGGTCCTTTG CAAGAGCGCA GACCACCGCG CCACGCTGCC CCCGGCGCAC TGCTCACCCG CCGCCAAGCC ACCGGCCACC 

ACCAGGAAAC GTTCTCGCGT CTGGTGGCGC GGTGCGACGG GGGCCGCGTG ACGAGTGGGC GGCGGTTCGG TGGCCGG75G 

+ 1MRCN LRR CPP ARWV AGE WGE CSAQ CGV 
TSPl-like Motif TSPl-llke Mocif 

3121 ATGCGCTGCA ACTTGCGCCG CTGCCCCCCG GCCCGCTGGG TGGCTGGCGA GTGGGGTGAG TGCTCTGCAC AGTGCGGCGT 
TACGCGACGT . TGAACGCGGC GACGGGGGGC CGGGCGACCC ACCGACCGCT CACCCCRCTC ACGAGACGTG TCRCGCCGCA 



4-1 GQR QRSV RCT 3HT GQAS HEC TEA 1?? 
TSPl-llke Motif 

3201 CGGGCAGCGG CAGCGCTCGG TGCGCTGCAC CAGCCACACG GGCCAGGCGT CGCACGAGTG CACGGAGGCC CTGCGGCCGC 
GCCCGTCGCC GTCGCGAGCC ACGCGACGTG GTCGGTGTGC CCGGTCCGCA GCGTGCTCAC GTGCCTCC3G 5 ACGCC3GC3 

+ 1PTTT QQC EAKC DSP TPG DGPE ECK DV:: 
TSPl-iike Motif 

3281 CGACTACCAC GCAGCAGTGT GAGGCCAAGT GCGACAGCCC AACCCCCGGG GACGGCCCTG .AAGAGTGCAA GGATGTGA-.-.C 

GCTGATGGTG CGTCGTCACA CTCCGGTTCA CGCTGTCGGG TTGGGGGCCC CTGCCGGGAC TTCT CACGTT CC?-G.-CTT: 

-I-IKVAY CPL VLK FQFC SRA YFR QMCC Y ^ C 
3361 AAGGTCGCCT ACTGCCCCCT GGTGCTCAAA TTTCAGTTCT GCAGCCGAGC ctacttccgc cagatgtgct gcaaaacct; 
TTCCAGCGGA tgacggggga ccacgagttt aaagtcaaga cgtcggctcg gatgaaggcg gtctacacga :gttttgg.-.: 



+ 1 Q G H 

3441 CCAGGGCCAC TAGGGGGCGC GCGGCACCCG GAGCCACAGC TGGCGGGGTC TCCGCCGCCA GCCCTGCAGC GGGCCGGC".- 



GGTCCCGGTG ATCCCCCGCG CGCCGTGGGC CTCGGTGTCG ACCGCCCCAG AGGCGGCGGT CGGGRCGTCG CCCGGCCGGT 



3521 


GAGGGGGCCC 
CTCCCCCGGG 


CGGGGGGGGC 
GCCCCCCCCG 


GGGAACTGGG 
CCCTTGRCCC 


AGGGAAGGGT 
TCCCTTCCCA 


GAGACGGAGC 
CTCTGCCTCG 


CGGAAGTTAT 
GCCTTCAATA 


TTATTGGGAA 
AATAACCCTT 


CCCCTGCAGG 
GGGGACGTCC 


3501 


GCCCTGGCTG 
CGGGACCGAC 


GGAGGATCCA 
CCTCCTAGGT 


CCCCAACCTC 
GGGGTTGGAG 


TGCCCTGCCC 
ACGGGACGGG 


GCCCCAGGGG 
CGGGGTCCCC 


CACCCCGACA 
GTGGGGCTGT 


TCCAGGCCAC 
AGGTCCGGTG 


CCCCTCATGG 
GGGGAGTACC 


3581 


TGCTACAGAC 
ACGATGTCTG 


CCTGCCCTGG 
GGACGGGACC 


GGCCCACACA 
CCGGGTGTGT 


CTCCTGCCAG 
GAGGACGGTC 


GAAGCCCTAC 
CTTCGGGATG 


ATCAATAAAG 
TAGTTATTTC 


TTCTGTCTTG 
AJ^GACAGAAC 





37 61 TAAAAAAAAA AAAAAA 
ATTTTTTTTT TTTTTT 
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Figure 5, Expression of ADAMTS-E in cDNA from osteoarthritic cartilage. 



Figure 6. Amino acid alignment of human ADAMTS-E with a GenScan prediction of ADAMTS-E from mouse genomic 
sequence. 



hADAMrS-E : 1) HASDGAEAPGAAQEPG— DAAKAVNRGGGTVGAAGSRGWGETCGHVASMAPACQILRWALALG 

--AOAMTS-E: ! 1 ) XMCDHSSySPPFypLLLRVGNGQCLPHKVDVKGTSDAAQTVNRGGSTVGAASSRGWGETCGHVAPMASACQILRWALALG 

Consensus :l) G P G DAA VNRGG TVGAA SRGWGETCGHVA MA ACQILRWAIAIG 



hADAMTS-E 
tADAMTS-E 



LGLMFEVTHAFRSQDEFLSSLESYEIAFPTRVDHNGALLAFSPPPPRRQRRGTGATAESRLFYKVASPSTHFLLNLTPSS 
LGLTFKVTHAFRSQDELLSSLESYEIArPTRVDHNGAMLAFSPPAFRRQRRGAGATTESRLFYKVAAPSTHFLLNLTRS? 
LGL F VTHAFRSQDE LSSLES YEI AFPTRVDHNGA LAFSPP RRQRRG GAT ESRLFYKVA ?STHFLLK:.TPS 



h.ADAMTS-E ( 142 ) RLLAGHVSVEYWTREGLAWQRAARPHCLYAGHLQGQASTSHVAISTCGGLHGLIVADEEEYLIEPLKGGPKGSRSPSESG 
mADAMTS-E ( 161 ) RLLAGHVSVEYWTREGLAWQRAAHAHCLYAGHLQGQAGSSHVAVSTCGGUiGLIVADDEEYLIEPLQGGPKGHRGPEESG 

Consensus (161) RLLAGHVSVEYWTREGLAWQRAAR HCLYAGHLQGQA SHVA STCGGLHGLIVAD EEYLIEPL GGPKG R PEESS 

241 3:: 

hADAMTS-E (222 ) PHVVYKRSSLRHPHLDTACGVRDEKPWKGRPW^iLRTLKPPPARPLGKETERGQPGLKRSVSRERVVETLWADKMMVAYH 
rtlADAMTS-E (241) PHWYKRSSLRHSHLDTACGVRDEKPWKGRPWWLRTLKPPPARPLGNESERGQLGLKRSVSRERVVETLWADKMMVAYH 

Consensus (241) PHVVYKRSSLRHPHLDTACGVRDEKPWKGRPWWLRTLKPPPARPLGNE ERGQ GLKRSVSRERYVETLVVADKMKVA r H 

321 4c: 

hADAMTS-E ; 302 ) GRRDVEQYVLAIMNIV AKLFQDSSLGSTVMILVTRLILLTEDQPTLEITHHAGKSLDSFCKWQKSIV>!H3 

rrADAMTS-E (321) GRRDVEQYVLAIMNITRSLLFLGGQVaKLFQDSSLGNIVNII,VTRLILLTEDQPTI.EITHHAGKSLDSFCKWQKSI VSHS 
Consensus (321) GRRDVEQYVLAIMNI AKLFQDSSLG VNILVTRLILLTEDQPTLEITHHAGKSLDSFCKWQKS I V HS 

401 480 
hADAMTS-E (372) GHGNAIPEKGVAKHDTAVLITRYDICIYKNKSCGTI^LAPVGGMCERERSCSVKEDIGUITAFTIAHEIGHTFGMMHDGV 
mADAMTS-E (401) GKGNAIPENGVANHDTAVI.ITRYDICIYKNKPCGXLGiAPVGGMCERERSCSINEDXGLATAFTIAHEIGHTFGMNHDGV 

Consensus (401) GHGNAIPENGVANHDTAVLITRYDICI YKNKPCGTLGLAPVGGMCERERSCS NEDIGLATAFTIAHEIGHTFGMNHDGV 

481 55C 
hADAMTS-E (452) GNSCGARGQDPAKLMAAHITMKTNPFVWSSCSRDYITSFLDSGLGLCLNNRPPRQDFVYPTVAPGCJAyDADEQCRFQHGV 
rrJ^DAMTS -E (481) GNGCGARGQDPAKL^^UU^ITMKTNPFVWSSCSRDyIT•SFLDSGI.GIrLNNRPPRQDFVYPTVAPGQAYDADEQCRFQHGV 

Consensus (481) GN CGARGQDPAKLMAAHITMKTNPFVWSSCSRDYITSFLDSGLGLCLNNRPPRQDFVYPTVAPGQAYDADEQCRFQKGV 

561 S'^- 
hADAMTS-E I 532 ) KSRQCKYGEVCSELWCLSKSKRCITSSXPAREGTLCQTHTXDKGHCYKRVCVPFGSRPEGVDGAHGPWTPWGDCSRTCGG 
mADAMTS-E ( 561 ) KSRQCKYGEVCSELWCLSKSNRCITNSIPAAEGTLCQTirnDKGWCyKRVCVPFGSRPEGVDGAWGPWTPWGDCSRSCGG 

Consensus (561) KSRQCKYGEVCSELWCLSKSNRCITNSIPAAEGTLCQTHTIDKGWCYKRVCVPFGSRPEGVDGAWGPWTPWGDCSR CC-G 

641 "20 
hADAMTS-E ( 612 ) GVSSSSRHCDSPRPTIGGKYCLGERRRHRSCirrDDCPPGSQDFREVQCSEFOSIPFRGKFYKWKTYRGGGVKACSLTCLA 
irADAMTS -E (641) GVSSSSRHCDSPRPTrGGKYCLGERRRHRSCNTNDCPPGSQDFREMQCSEFDSVPFRGKFYTWKTYRGGGVKACSLTCLA 
Consensus (641) GVSSSSRHCDSPRPTIGGKYCLGERRRHRSCNT DCPPGSQDFRE QCSEFDS PFRGKFY WKTYRGGGVKACSLTCLA 

721 300 
hADAM-^S-E ( 692) EGFNFYTERftAAWDGTPCRPDTVDrcVSGECKKV<K:DRVLGSBI»REI>KCRVCGGDGSfiCETIEGVFSPASPGAGYEDVV 
InADAMT S - E (721) EGFNFYTBRa2aWIX?EPCRPDTWCVSGECKHV<XiDRVLGSDLREDKCRVCGGtfflSACETXECT^ 
Consensus (721) EGFNFYTERAAAVVDGTPCRPDTVDICVSGECKHVGCDRVLGSDLREDKCRVCGGDGSACETIEGVFSPA PG GYEDVV 

801 38C 
hADAMT S - E (772) WIEKGSVHXFIQDI^SLSHlJiI.KGDQESLIiEGIiPGTFQEHRi.PIAffI7FQiRQGEDQVQSI.EALGPINASlI 
rpADAMTS - E (801) WIPKGSVHIFIQDiJ^LSLSHIJU.KGIKiESIiLEGLPGTPQPHKi.BIAGTSFHEKCiGPDQAQSr.EAMPINASLI IMVLAQ 
Consensus (301) WIPKG3VHIFIQDLNLSLSHLALKGDQESLLLEGLPGTPQPHRLPLAGTTF LRQGPDQ QSLEALGPINASLI MVLA 



hADAMTS-E 
mADAMTS-E 
Consensus 



hADAMTS-E 
mADAMTS-E 
Consensus 



(852) TBIPALRYRFNAPIARDSLPPYSWKYAPWTKCSAQCAGGSQVQAVECBHaLDSSAVAEHYCSAHSKLPKRQRACNTEPCP 
(881) AELPALHYRFNAPIfiRDALPPySKHyAP«TKCSA?»aGGSQVQVVECRNQI.DSSAVftPHyCSGHSKLPKRQRACNTEPCP 
(881) ELPAL YRFNAPIARD LPPYSWHYAPWTKCSAQCAGGSQVQ VECRNQLDSSAVAPHYCS HSKLPKRQRACKTEPCP 



961 



:04c 



(932) PDWWGRKSLCSRSCDAGVRSRSWCQRRVSAAEEKiiLDDSACPQPRPPV&EACHGPTCPPEWAALDWSECTPSCGFGLR 
(961) PDm?VGmSRCSRSCDAGVBSRSWCQRRVSAAEEKRiDDSAePQPRPPVIiEACQGPMePPEWATLDWSECTPSCGPGLR 
(961) PDWVVGNWS CSRSCDAGVRSRSVVCQRRVSAAEEKALDDSACPQPRPPVLEAC GP CPPEWA LDWSECTPSCGPGLR 



1041 



r.ADAMTS-E ! 1012 1 HRWLCKSADHRATLPPRHCSPAAKPPATMRCNLRRCPPARWVAGEWGECSAQCGVGQRQRSVRCTSHTGQASnECTEAL 

.tADAMTS-E ; 1041 ) HRWLCKSADQRSTLPEGHCLPAAKBPSTMRCNLRRCPPARWVTSEWGEC GLGQQQRTVRCTSHTGQPSRECTEAL 

Consensus ;i041) HRVVLCKSAD R TLPP HC PARKPP TMRCNLRRCPPARWV EWGEC G GQ QR VRCTSHTGQ S ECTEAL 

1121 1180 

r.ADAMTS-S ( 1092 ) RPPTTTQQCEAKCDSPTP-GDGPEECKDVNKVAYCPLVtKFQE'CSRAYFRQMCCKTCQGH 

( 11x7) R-PSTMQQCEAKCDSVVPPGDGPEECKDVNKVAYCPLVLKFQFCSRAYFRQMCCKTCQGR 

.'1121) R P T QQCEAKCDS P GDGPEECKDVNKVAYCPLVLKFQFCSRAYFRQMCCKTCQG 
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